NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective

Naderloui, Nima; Yan, Shenao; Wang, Binghui; Fu, Jie; Wang, Wendy Hui; Liu, Weiran; Hong, Yuan (August 2025, USENIX Security)

Free, publicly-accessible full text available August 13, 2026
Synthesis of Dynamic Masks for Information-Theoretic Opacity in Stochastic Systems

Udupa, Sumukha; Shi, Chongyang; Fu, Jie (May 2025, 16th ACM/IEEE International Conference on Cyber-Physical Systems)

Free, publicly-accessible full text available May 6, 2026
Adaptive Incentive Design for Markov Decision Processes with Unknown Rewards

MA, Haoxiang; Han, Shuo; Hemida, Ahmed; Kamhoua, Charles; Fu, Jie (March 2025, Transactions on machine learning research)

Free, publicly-accessible full text available March 28, 2026
Adaptive Incentive Design for Markov Decision Processes with Unknown Rewards

Ma, Haoxiang; Han, Shuo; Hemida, Ahmed; Kamhoua, Charles A; Fu, Jie (March 2025, Transactions on Machine Learning Research)
Poupart, Pascal (Ed.)
Incentive design, also known as model design or environment design for Markov decision processes(MDPs), refers to a class of problems in which a leader can incentivize his follower by modifying the follower's reward function, in anticipation that the follower's optimal policy in the resulting MDP can be desirable for the leader's objective. In this work, we propose gradient-ascent algorithms to compute the leader's optimal incentive design, despite the lack of knowledge about the follower's reward function. First, we formulate the incentive design problem as a bi-level optimization problem and demonstrate that, by the softmax temporal consistency between the follower's policy and value function, the bi-level optimization problem can be reduced to single-level optimization, for which a gradient-based algorithm can be developed to optimize the leader's objective. We establish several key properties of incentive design in MDPs and prove the convergence of the proposed gradient-based method. Next, we show that the gradient terms can be estimated from observations of the follower's best response policy, enabling the use of a stochastic gradient-ascent algorithm to compute a locally optimal incentive design without knowing or learning the follower's reward function. Finally, we analyze the conditions under which an incentive design remains optimal for two different rewards which are policy invariant. The effectiveness of the proposed algorithm is demonstrated using a small probabilistic transition system and a stochastic gridworld.
more » « less
Free, publicly-accessible full text available March 28, 2026
Planning with Probabilistic Opacity and Transparency: A Computational Model of Opaque/Transparent Observations

https://doi.org/10.1109/CDC56724.2024.10886248

Udupa, Sumukha; Fu, Jie (December 2024, IEEE)

Full Text Available
Integrating Contact-Aware CPG System for Learning-Based Soft Snake Robot Locomotion Controllers

https://doi.org/10.1109/TRO.2025.3539173

Liu, Xuan; Onal, Cagdas D; Fu, Jie (January 2025, IEEE Transactions on Robotics)

Full Text Available
Information-Theoretic Opacity-Enforcement in Markov Decision Processes

Shi, Chongyang; Bu, Yuheng; Fu, Jie (August 2024, Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI-24))

Full Text Available
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers

Wei, Cong; Chen, Yang; Chen, Haonan; Hu, Hexiang; Zhang, Ge; Fu, Jie; Ritter, Alan; Chen, Wenhu (October 2024, ECCV 2024)

Full Text Available
Opacity-enforcing active perception and control against eavesdropping attacks

Udupa, Sumukha; Rahmani, Hazhar; Fu, Jie (December 2023, Conference on Decision and Game Theory for Security)

Full Text Available
Active Perception With Initial-State Uncertainty: A Policy Gradient Method

https://doi.org/10.1109/LCSYS.2024.3513896

Shi, Chongyang; Han, Shuo; Dorothy, Michael; Fu, Jie (January 2024, IEEE Control Systems Letters)

Full Text Available

« Prev Next »

Search for: All records